Improvements in audio processing and language modeling in the CU communicator

نویسندگان

  • Jianping Zhang
  • Wayne H. Ward
  • Bryan L. Pellom
  • Xiuyang Yu
  • Kadri Hacioglu
چکیده

This paper presents some up-to-date audio processing techniques which have been developed and integrated into the University of Colorado (CU) communicator system. The CU Communicator is an interactive human-machine dialogue system for airline, hotel and rental car information. The baseline system was fully functional in June 1999. Since then, many improvements have been made. The paper will concentrate on acoustic echo cancellation, voice activity detection (VAD) and language modeling techniques and provide a paradigm for speech and audio processing in a dialog system with barge-in capabilities. Specifically, a real-time block least-mean-square (LMS) algorithm is discussed. A robust voice activity detector using energy threshold is applied to detect user voice. Experimental results are presented and some real-time implementation issues are addressed.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

University of Colorado Dialogue Systems for Travel and Navigation

This paper presents recent improvements in the development of the University of Colorado “CU Communicator” and “CUMove” spoken dialog systems. First, we describe the CU Communicator system that integrates speech recognition, synthesis and natural language understanding technologies using the DARPA Hub Architecture. Users are able to converse with an automated travel agent over the phone to retr...

متن کامل

University of Colorado Dialog Systems for Travel and Navigation

This paper presents recent improvements in the development of the University of Colorado “CU Communicator” and “CUMove” spoken dialog systems. First, we describe the CU Communicator system that integrates speech recognition, synthesis and natural language understanding technologies using the DARPA Hub Architecture. Users are able to converse with an automated travel agent over the phone to retr...

متن کامل

The CU communicator: an architecture for dialogue systems

This paper presents our recent work towards development of the University of Colorado (CU) Communicator, an interactive dialogue system for airline, hotel and rental car information. The CU Communicator integrates speech recognition, synthesis and natural language understanding technologies using the DARPA Hub Architecture to allow users to converse with an automated travel agent. During a typi...

متن کامل

A word graph interface for a flexible concept based speech understanding framework

In this paper, we introduce a word graph interface between speech and natural language processing systems within a flexible speech understanding framework based on stochastic concept modeling augmented with background ”filler” models. Each concept represents a set of phrases ( written as a context free grammar (CFG)) with the same meaning, and is compiled into a stochastic recursive transition ...

متن کامل

Recent advances in speech recognition system for IBM DARPA communicator

In this paper, we present methods to improve speech recognition performance of the IBM DARPA Communicator system. Our efforts for acoustic modeling include training a domain specific yet broad acoustic model, speaker clustering and speaker adaptation using feature space transforms. For language modeling, we achieved improvements by using compound words, carefully designed LM classes and adjusti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001